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o 

<^^> Lattice field theory with the 9 term suffers from the sign problem. The sign problem 

appears as flattening of the free energy. As an alternative to the conventional method, the 
i— i ' Fourier transform method (FTM), we apply the maximum entropy method (MEM) to Monte 

Carlo data obtained using the CP 3 model with the 9 term. For data without flattening, we 
. obtain the most probable images of the partition function 2(9) with rather small errors. The 

• results are quantitatively close to the result obtained with the FTM. Motivated by this fact, 
we systematically investigate flattening in terms of the MEM. Obtained images 2(9) are 

■ consistent with the FTM for small values of 9, while the behavior of 2(9) depends strongly 

^ ' on the default model for large values of 9. This behavior of 2(9) reflects the flattening 

^\ , phenomenon. 

o ■ 

o : 

CN ' §1. Introduction 

o 

. In QCD and the CP^ -1 model, topologically non-trivial configurations play im- 

portant roles in determining dynamical properties and the vacuum structure. The 
effect of these configurations is introduced into the action with a 9 term. The exis- 
tence of the 9 term is associated with several interesting topics, such as the strong 
CP problem and possible rich phase structures in 9 space. It was pointed out by 't 

• HooftP that a color magnetic monopole becomes a dyon-like object in regions for 
. which 9^0, and this could result in the appearance of new phase structure. It 

was also shown in Ref. [2J) that a new phase could emerge in the Z(N) model. In 



the CP^ 1 model, it is known that there is a first order phase transition point at 



9 = 7T in the strong coupling region. To obtain a comprehensive understanding of 
the phase structure in 9 space, it is necessary to analyze the phase structure in the 
weak coupling region. 

A numerical simulation based upon the importance sampling method is one of 
the most promising tools to study non-perturbative properties of field theories. This 
method, however, is confronted with difficulties in the case of theories possessing the 
9 term, because this term causes the Boltzmann weight to be complex. This is the 
"complex action problem" or the "sign problem" . A conventional way to circumvent 
this problem is to calculate the partition function 2(9) by Fourier-transforming 
the topological charge distribution P(Q), which is calculated with the real positive 
Boltzmann weight We call this the "Fourier transform method" (FTM). 
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Although this approach works well for small lattice volumes and/or in the strong 
coupling region, it does not work for large volumes and/or in the weak coupling 
region, due to flattening of the free energy density f(0). This flattening phenomenon 
results from the error in P(Q) obtained using Monte Carlo (MC) simulations and 
leads to a spurious phase transition for 9 = 9{(< n). This is the sign problem. To 
overcome this problem requires exponentially increasing statistics. 

As an alternative approach to the FTM,®^ we have applied the maximum en- 
tropy method (MEM) to the treatment of the sign problem. This method is based 
upon Bayes' theorem, and it has been widely used in various fields ED JSJ 
The MEM gives the most probable images by utilizing sets of data and our knowl- 
edge about these images. The probability distribution function, called the posterior 
probability, is given by the product of the likelihood function and the prior probabil- 
ity. The latter represents our knowledge about the image and the former indicates 
how data points are distributed around the true values. The prior probability is 
given as an entropy term, which plays the essential role to guarantee the uniqueness 
of the solution. 

In order to investigate whether the MEM is applicable to the sign problem, 
we applied it to mock data, which were prepared by adding Gaussian noise to a 
model. In Ref. I21|) . a Gaussian P(Q) was used. The corresponding free energy 
can be calculated analytically using the Poisson sum formula. As mock data, data 
with flattening and without flattening were prepared. We found that in both cases, 
the MEM reproduced smooth f(9). The values of obtained f(9) are in agreement 
with exact ones and the errors are reasonably small compared to those resulting 
when using the Fourier transform. These results might lead one to believe that the 
MEM has the effect of smoothing data and that, for this reason, it is not a suitable 
technique for detecting singular behaviors, such as phase transitions. To determine 
whether this is indeed the case, we analyzed some toy models that exhibit singular 
behavior originating from the characteristics of the models themselves P*» We found 
in Ref. 22) that in fact, the MEM can detect such singular behavior. 

In the present paper, we apply the MEM to MC data of the CP 3 model. In order 
to check whether the MEM can treat real data in the 9 term, data without flattening 
are used. Next, we investigate how the flattening phenomenon is observed within 
the MEM. In the MEM analysis, it is necessary to give prior information. Generally, 
an obtained image depends on this prior information. We systematically investigate 
the influence of this information on the most probable image. The uncertainty in 
the most probable image is estimated as an error. We also check the effectiveness of 
the MEM by considering this error. 

This paper is organized as follows. In the following section, we summarize the 
formulations used in this work. Numerical results are presented in In that section, 
we investigate in detail the behavior of the obtained most probable images. In the 
final section, conclusions and discussion are presented. 
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§2. Formulations 

2.1. Topological charge distribution 

The lattice action of the CP^ -1 model with the 9 term is defined by 

S e (z,z)=S(z,z)-i9Q(z,z), (2-1) 

where S is a lattice action and Q is a topological charge. Complex scalar fields of 
the model are denoted by z and z, where z is the complex conjugate of z. 
We choose an integer- valued topological charge, 23 ' 



2 * □ 

where the plaquette contribution A\j is given by 

A a = -^{Ap(n)+A v (n + fi-Ap(n + v)-A v (n)}. (mod 2vr) (2-3) 

Here we have A^(n) = axg[z(n)z(n+p)], and these quantities satisfy A^n) G [— 7T, tt]. 

As the conventional method for avoiding the complex Boltzmann weight, the 
partition function 2(9) is calculated by Fourier-transforming the topological charge 
distribution P(Q): 

Z(9) = Y, p (Q)e i0Q - (2-4) 

Q 

The distribution P(Q) is given by 

\\dzdz\ Q e- s ^ , . 

which is calculated with a real positive Boltzmann weight. The measure [dz<iz]Q 
expresses the meaning that the integral is restricted to configurations of z and z 
with Q. Note that P(Q) is normalized as P(Q) = 1. 

We update configurations with the combined use of the overrelaxation and 
Metropolis algorithms. From the generated configurations, we measure Q and con- 
struct a histogram by counting the number of configurations with Q. Because the 
P(Q) under consideration decreases rapidly as a function of Q, it is convenient to 
use the set method,^ in which an entire range of values of Q is divided into sets 
S% (i = 1, 2, 3, • • • ). In the present study, each of the sets Si consists of 4 bins, 
Q = 3i — 3, 3i — 2, 3i — 1, and 3i, and thus each set shares its two external bins 
with the adjacent sets. Explicitly, the shared bins are Q = 3k (k = 1, 2, 3, • • • ). In 
order to generate configurations more effectively and to reduce errors, the action is 
modified by adding a trial function Pt(Q) satisfying 

S eff = S -In P t (Q). (2-6) 

The form of Pt(Q) is chosen as Pt(Q) oc e~ a( °^ in the present study, where a is 
adjusted so that P(Q) becomes almost flat in any given set, in order to reduce 
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errors. The power 2 of Q in Pt{Q) is varied in a manner depending on the coupling 
constant.^ 

For the lattice action, we use a fixed point action (FP action/251 j n order to reduce 
lattice artifacts. Because our simulations require a large number of measurements 
due to the 9 term, we employ 9 coupling constants, which are limited to a short 
range and lie within one plaquette. With this action, it was shown in Ref. 125)1 
that the lattice artifact is negligible up to somewhat small coupling constant, which 
corresponds to a correlation length of several units of the lattice spacing. We refer 
the reader to Ref. I25|) for actual values of the coupling constants. 



2.2. Flattening of the free energy density 

The free energy density f{9) is calculated by Fourier-transforming P(Q) obtained 
by MC simulation. The quantity f(9) is defined as 

f(9) = ~lnJ2 P (Qy 0Q , (2-7) 
Q 

where V = L 2 , the square of the lattice size. 

The MC data for P(Q) consist of the true value, P(Q), and its error, AP(Q). 
When the error at Q = dominates because of the exponential damping of P(Q), 
f{9) is closely approximated by 

f(Q) - ~\ In \f v]ip) + AP(0)] , (2-8) 

where f{6) is the true value of f(6). Because f{6) is an increasing function of 6, as 
in the case of f(9) derived from the Gaussian P{Q) in the strong coupling region, 
AP(0) dominates for large values of 9. If |AP(0)| ~ e" y ^ e ) at 9 = 9 { , then f{9) 
becomes almost flat for 9 > 9f. This is called "flattening of the free energy density", 
and it has been misleadingly identified as a first order phase transition, because the 
first derivative of f{9) appears to jump at 9 = 9{. To avoid this problem, we must 
carry out a very large number of measurements in the FTM; indeed, the order of e v 
measurements are needed. 

2.3. MEM formalism 

In this subsection, we briefly explain the MEM in terms of the 9 term. ( For 
details, see Ref. E3J. ) 

In a parameter inference, such as the x 2 fitting, the inverse Fourier transform 

—Z{0)e- ie ® (2-9) 

is used. In the numerical calculations, we use the discretized version of Eq. (|2-9|) ; 
P(Q) = ^2 n KQ t1l Z n , where Kq jH is the Fourier integral kernel and Z n = Z(9 n ). In 
order for the continuous function Z{9) to be reconstructed, a sufficient number of 
values of 9, which we denote by Ng, is required so that the relation Ng > Nq holds, 
where Nq represents the number of data points in P(Q) (Q = 0, Nq — 1). A 
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straightforward application of the x 2 fitting to the case Nq > Nq leads to degenerate 
solutions. This is an ill-posed problem. 

The maximum entropy method is one promising tool to solve this ill-posed prob- 
lem, and it gives a unique solution. The MEM is based upon Bayes' theorem, ex- 
pressed as 

pr ob(* W |P(Q),/) - MQ) ™$*f {Zm \ (MO, 

where piob(A\B) is the conditional probability that A occurs under the condition 
that B occurs. The posterior probability prob(.Z(0)|P(Q), J) is the probability that 
the partition function Z(9) is realized when the MC data {P(Q)} and prior infor- 
mation / are given. The likelihood function prob(P(Q)\Z(9), I) is given by 

prob(P(Q)|Z(0),/) = ^e"K (2-11) 

where Xl is a normalization constant and x 2 1S a standard x 2 function. 

The probability prob(Z(9)\I), which guarantees the uniqueness of the solution, 
is given in terms of an entropy S as 

pT ob(Z(9)\I) = -±—e- aS , (2-12) 
X s (a) 

where a and Xs{ct) are a positive parameter and an a-dependent normalization con- 
stant, respectively. As S, the Shannon-Jaynes entropy is conventionally employed: 



Ng r 

■, — m n — Z n In — 
m r 



n=l 



(2-13) 



Here m n = m(9 n ) represents a default model. 

The posterior probability pTob(Z n \P(Q), I), thus, is given by 

1 i 2 e w \ z \ 

pmb(Z n \P(Q),I,a,m) = e~*x +aS = (2-14) 

X L X s (a) X L X s (a) 

where it is explicitly expressed that a and m are regarded as new knowledge in 
pTob(Z n \P(Q), I, a, m). For the prior information /, we impose the criterion 

Z n > 0, (2-15) 

so that pmb(Z n < 0|J, a, m) = 0. 

The most probable image of Z n , denoted cts ; is calculated according to the 
following proceduresP^^ 

1. Maximizing W[Z] to obtain the most probable image Z n °^ for a given a: 

* W [Z] \ z=zia) = ^- (-\x 2 + as) \ z=zia) = 0. (2-16) 



6Z n L J ^=^"> SZ n V 2 
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2. Averaging Z n to obtain the a-independent most probable image Z n : 

Z n = J da Z^ a) prob(a\P(Q),I,m). (2-17) 

The range of integration is determined so that the relation 

prob(a|P(Q), I, m) > prob(a|P(Q), I, m)/10 holds, where prob(a|P(Q), I, m) 

is maximized at a = a. 

3. Error estimation: 

The error of the most probable output image Z n is calculated as the uncertainty 
of the image, which takes into account the correlations of the images Z n among 
various values of 9 n : 

((5Z n ) 2 ) = f da {(5Z^) 2 ) prob(a|P(Q),/,m). (2-18) 

Here SZ n and 5Z^ represent the error in Z n and that in Zn , respectively. 

2.4. The most probable image and the parameter a 

In the MEM formalism, a real positive parameter a is introduced. This param- 
eter plays the role of the trade-off between S and x 2 . The most probable value of 
a is determined by the posterior probability of a, piob(a\P(Q), I,m), appearing in 
Eq. (|2T7[) . The probability piob(a\P(Q), I,m) is given by 

prob(a|P(Q), I, m) = P(a) oc g(a)e w{a)+A(a) , (2-19) 

where W(a) = W[Z^} and 2A(a) = £fchiW(" + A fc (a))}. The function A(a) 
represents the contribution of fluctuations of Z(9) around Z^ a \9), and the quantities 
Afc(a) are the eigenvalues of the real symmetric matrix 

_ 1 d 2 X 2 

^n,m = 2 8Z ■^"n\z=Z ( - a ) ' (2-20) 

Here, the function g{a) represents the prior probability of a and is chosen according 
to prior information. In general, two types of g{a) are employed, one according to 
Laplace's rule, fl r Lap(o ; ) = const, and one according to Jeffrey's rule, <?j e f(a) = 1/a. 
The latter rule is determined by requiring that P{a) be invariant with respect to a 
change in scale, because a is a scale factor. The former rule means that we have no 
knowledge about the prior information of a. In general, the most probable image 
Z{9) depends on g{a). In the present study, we investigate the sensitivity of Z{9) 
to the choice of g(a). This is done by studying the quantity 



A(9) = I^LapW 5 Jcf( "" , (2-21) 

CZ L ap(0)+2jcf(0))/2 

where Z^ p (9) and Zj e f (9) represent the most probable images according to Laplace's 
rule and Jeffrey's rule, respectively. The quantity A{6) is a relative difference, which 
is defined as the absolute value of the difference between Z-^^9) and Zj e f(0) divided 
by the average of the two. 
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§3. Numerical results 



3.1. Flattening in the Monte Carlo data 

In this study, we carried out Monte Carlo simulations of the CP 3 model with the 
fixed point action. We fixed the coupling constant (3 to 3.0 and the lattice size L to 
38 and 50. The corresponding correlation length is approximately 7 in units of the 
lattice spacing. We employed the set method and the trial function method. The 
total number of measurements for each set was at least on the order of 10 6 . Parameter 
values used in the simulations are listed in Table HJ where Q m m — Qmax represents 
the range of the topological charge with which MC simulations were performed. It is 
noted that all data, except for L = 38 and 50, were obtained in a previous studyP^ 

Table I. Parameter values used in the MC simulations of the CP 3 model with the FP action. For 
the MEM analysis, new MC simulations were performed for L — 38 and 50. 



p 


L 


Qmin Q 


3.0 


12 


0-30 




24 


0-18 




32 


0-24 




38 


0-27 




46 


0-33 




50 


0-15 




56 


0-18 



total number of measurements (M/set) 



10.0 
10.0 
3.0 
5.0 
1.0 
30.0 
5.0 



Figure ^ displays the topological charge distribution P{Q) (left panel) and the 
free energy density f(8) (right panel) calculated by use of the FTM. The error in 
f{9) was calculated using the jackknife method. 
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Fig. 1. Topological charge distributions P(Q) (left panel) and free energy densities f(0) (right 
panel) of the CP 3 model for (3 — 3.0 and various lattice sizes L. 



3.5 



In the right panel of Fig. it is observed that f{6) depends on L. The free 
energy f{6) increases as a function of L. For 6 < 2.0, f(6) seems to approach an 
asymptotic line as the lattice volume increases (L > 38). For 8 > 2.0, by contrast, 
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the finite size effect in f(9) is ciearly observed. For L = 56, flattening is clearly 
seen. For L = 50, f(9) is not plotted for 9 > 2.0, because in this case negative 
values of 2(9) appear. We also call this behavior "flattening" for the same reason 
that the error in P(Q) causes the FTM to become invalid (see H2.2|) . Although the 
total number of measurements carried out here is as large as 3 x 10 7 in each set for 
L = 50, flattening is still observed. 

3.2. MEM analysis of the Monte Carlo data 

As shown in the previous subsection, f(9) exhibits flattening phenomena for 
L = 50 and 56, while it behaves smoothly for smaller volumes in the FTM. In the 
present study, we systematically study flattening in terms of the MEM. For this 
purpose, the data for L = 38 and 50 are used. 

In our analysis, three types of the default models are used: (i) m c (9) = const; 
(ii) itig(0) = exp [— 7^r# 2 ] ; (hi) m(9) = 2(9) for smaller volumes. In case (i), 
three values of m c (6), 1.0, 1.0 x 10~ 3 and 1.0 x 10~ 5 , are employed, and only the 
results for m c (9) = 1.0 are presented. Case (ii) is the Gaussian default model, and 
the parameter 7 in mo,(9) is varied over a wide range in the analysis. In case (iii), 
to analyze the data for the lattice size Lq, images 2(9) obtained from the MEM 
analysis for smaller volumes are used as default models. This is because we believe 
that 2(8) for smaller volumes might have properties similar to those for Lq. Such 
2(9) may be regarded as prior information. For Lq = 50, 2(9) for L = 24, 32 and 
38 are employed as the default models. These are denoted as m L / Lo (9) = rn L / 50 (8). 
Throughout this paper, it is understood, unless otherwise stated, that Laplace's 
rule is used for g(a). The number Nq is so chosen that P(Q) > 10~ 18 holds for 
L = 38 and P(Q) > 10~ n holds for L = 50. The function x 2 in Eq. (l2~TTj) is 
given in terms of the inverse covariance matrix of the MC data {P(Q)}. The inverse 
matrix is calculated with such precision that the product of the covariance matrix 
and its inverse has off-diagonal elements that are no larger than 0(W~ 30 ). With 
these conditions, the value of Nq is 5 for L = 38 and 7 for L = 50. It is noted that 
the analysis was performed with quadruple precision in order to properly reproduce 
2(9), which ranges over many orders. 

3.2.1. Non-flattening case 

Firstly, we apply the MEM to the data without flattening (L = 38). The left 
panel of Fig. [^displays 2^ a \0) for a given a. Here, m c (9) and mo,(9) with 7 = 1.0 
are used as the default models. The partition function obtained using the FTM, 
^Four(^)) is also plotted for comparison. For each m(9), the a dependence of 2^(9) 
is almost indiscernible for a £ [10, 100]. It is seen that all the images have only very 
weak dependence on m(9) over the entire range of values of 9. It is also observed 
that all the results of the MEM are consistent with those of the FTM. In the right 
panel, it is seen that P(a) depends on m(8). The peaks of all the P(a) are located in 
the range a < 50. The most probable image 2(9) is calculated by using Eq. (|2-17j) . 
The integrals over a are trivial, because 2^ (8) depends on a only very weakly over 
the range of integration. Thus, it is expected that the values of 2(8) for various 
m(8) agree with those of 2-p om (8) over the entire range of values of 9. In fact, we 
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Fig. 2. Images Z^ a '(6) for a given a (left panel) and probabilities P(a) in the non-flattening case 
(L — 38). In the left panel, m c (8) and t?ig(#) with 7 = 1.0 are used. In addition, mG(6) with 
7 = 0.6 and 1.2 are used in the right panel. 



find that all the 2(9) are equal to 2p OUT (9) within numerical errors. The errors are 
calculated according to the procedure 3 outlined in ^ 12.31 Some of these results are 
listed in Table ITU specifically, the values of 2(9) for various itiq(9) and 2p our (9) at 
three values of 9. These values of 8 are chosen as representatives; the first value is 
9 = 2.00. Up to this value, the asymptotic line of f(9) is observed (see Fig. UJ). The 
second one is 9 = 3.14 chosen as a value near tt. The third one is 9 = 2.60, chosen 
as a value approximately halfway between these two. It is noted that the errors in 
2(9) are rather small for the entire range of values of 9. 



Table II. Values of the most probable images Z(9) at 6 = 2.00, 2.60 and 3.14. As default models, 
maid) with 7 = 0.6, 1.0 and 1.2 were used. For comparison, ^Four(^) is also listed. 



e 


•ZFour ($) 


Z-y=o.e(6) 


2y=l. O (0) 


Z J= i. 2 (0) 


2.00 

2.60 
3.14 


2.840(46) xlO" 2 
0.675(47) xl0~ 2 
0.367(51) xlO -2 


2.846(100) xlO" 2 
0.696(57) xlO -2 
0.343(52) xl0~ 2 


2.844(81) xlO" 2 
0.707(46) xlO" 2 
0.322(42) xlO" 2 


2.844(73) xlO" 2 
0.712(41)xl0" 2 
0.312(37) xlO" 2 



The calculations discussed above were performed using Eq. l)2-17p with Laplace's 
rule. When Jeffrey's rule is employed, the peak of P(a) appears at a value of a 
smaller than that in the case of Laplace's rule. The probability Ph&p(ot) for 7 = 1.2, 
for example, peaks at a = 50, while Pj e f(a) peaks at a = 35. (Here, PLap(«) and 
-Pjef(oO represent P(a) for Laplace's and Jeffrey's rules, respectively. ) Although 
-Flap (a) and Pj e f (ct) peak at different values of a, similar images for 2(9) are obtained 
with slightly different errors, because 2^ a \9) is almost independent of a [see Eq. 
()2-17|) ]. We, thus, find in the non-flattening case that the MEM gives most probable 
images that are almost independent of the prior information and is consistent with 
the FTM. 
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3.2.2. Flattening case 

Now that we have found that the MEM is applicable to the analysis of the MC 
data, let us turn to the analysis of data with flattening (L = 50). In Fig. [U we 
show 2p OUT (9) obtained using the FTM for data with 30.0M/set. Although 2p OUT (9) 
behaves smoothly, its errors are large over a large range of values of 9 (specifically 
for 9 > 2.4). These large errors result from the error propagation of P(Q) through 
the Fourier transform. Let us here consider its effect by studying a quantity which 
represents the error propagation of P(Q) in the case that there is no correlation of 
the distribution P(Q) for different values of Q, 

e = £|4P(Q)|. (3-1) 

Q 

Here, e = 3.610 x 10 -4 , and the value of 2p OUT (9) is comparable with that of e at 
9 ~ 2.4. Figure |21 displays that when the value of 2-p onv (9) is smaller than that of e, 
the error in 2p OUT (9) becomes large. This approximately holds for all cases we have 
investigated. In the MEM, e could be an indicator for the influence of the error of 
P(Q) on 2(9). This point is discussed in the following. In the analysis of data with 
flattening, much care is required.^ 

In order to properly evaluate 2(9) obtained using the MEM, we carefully inves- 
tigate (i) the statistical fluctuations of 2(9), (ii) the g(a) dependence of 2(9), and 
(iii) the relative error in 2(9). 



1 
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N 0.001 
0.0001 
1e-05 
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e 

Fig. 3. Partition function Zf out (9) obtained using the FTM. The number of measurements is 
30.0M/set. The arrow indicates the value of e (= 3.610 x 1CT 4 ). The errors were calculated 
with the jackknife method. 

(i) The statistical fluctuations of 2(9) 

Figure 0] displays five images 2(9), which are called Data A, B, C, D and E, 
respectively. These images were calculated from the data with 2.0M/set, which are 
independent of each other. The uncertainties in 2(9) calculated with Eq. ()2T8|) are 
indicated as errors in Fig. |1J As a default model, the Gaussian default with 7 = 5.0 



Fourier x 
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Fig. 4. The five most probable images Z(9) for L — 50 with 2.0M/set. Here, the Gaussian default 
model with 7 = 5.0 was used. The arrow indicates the value of e (= 1.441 x 10~ 3 ) in Data A. 
The errors in Z{6) were calculated by use of Eq. (12-181 1. 



were used. It is seen that all the images 2(9) fall on the same curve for 9 < 2.0, 
while they behave differently for 9 > 2.0. The value of e is at least 1.3 x 10 -3 for 
these five sets of data, and the values of all the images 2(9) are smaller than that 
of e for 9 > 2.0. Figure 0] indicates that 2(9) fluctuates greatly when the value of 
2(9) is smaller than that of e. 

To see how 2(9) depends on the statistics, we varied the number of measure- 
ments. We fixed the value of 9 and chose the values 9 = 2.31 and 3.14. The value 
9 = 2.31 was chosen for the previously stated reason that Zp our (9) starts to contain 
large errors at this value for the data with 30.0M/set (see Fig. |HJ). Figure [5] displays 
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Fig. 5. Values of 2(6) for 9 — 2.31 (left panel) and 3.14 (right panel). The horizontal axis represents 
the number of measurements. Here, the Gaussian default model with 7 = 5.0 was used. 
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2(9) for 9 = 2.31 (left panel) and 3.14 (right panel). The horizontal axis repre- 
sents the number of measurements. In the left panel, it is seen that the fluctuations 
of 2(2.31) become smaller as the number of measurements increases. In the right 
panel, it is seen that 2(9) for 15.0M/set fluctuates as greatly as 2(9) for 2.0M/set. 
In Table ITTT1 the values of 2(9) for 9 = 2.31, 2.60 and 3.14 are listed for the cases 
of 20.0M/set, 26.0M/set and 30.0M/set. For 9 = 2.31 and 2.60, the images 2(9) do 
not vary within the errors as the number of measurements increases. For 9 = 3.14, 
the value of 2(9) varies significantly as the number of measurements increases. In 
the cases 9 = 2.31 and 2.60, the values of 2(9) are comparable with that of e for 
30.0M/set (e = 3.610 x 10" 4 ), while in the case 9 = 3.14, the value of 2(9) is one 
order smaller than e. This indicates that when the value of 2(9) is smaller than that 
of e, 2(9) is strongly affected by the error in P(Q). Hereafter, we concentrate on 
the data with 30.0M/set. 

Table III. Values of Z(9) for 6 = 2.31, 2.60 and 3.14. Here, the same default model as in Fig. ^Jis 
used. 



statistics(M / set) 


,Z 7= 5.o(2.31) 


^ 7=5 .o(2.60) 


2^=6.0(3.14) 


20.0 


4.07(39) xlO -4 


2.30(31) xlO" 4 


3.93(22) xl0~ b 


26.0 


4.22(40) xl0~ 4 


2.69(33) xlO" 4 


5.83(26) xlO" 5 


30.0 


4.58(40) xlO -4 


2.81(33) xl0~ 4 


5.37(26) xlO -5 



(ii) The g(a) dependence of 2(9) 

The most probable image 2(9) is obtained by performing the integral with re- 
spect to a according to the procedure 2 outlined in M2.31 The probability P(a) in Eq. 
(|2-19|) involves the prior probability of a, g(a). In the present study, we investigate 
the g(a) dependence of 2(9) by calculating Eq. (J2-21|) . 

Before calculating A(9), we investigate how g(a) affects the behavior of P(a). 
From the definition given in H2.41 the following relation holds, up to the normalization 
constant between Phapfa) and Pj e f(a): 



Pjef(a) oc fl r jef(a)^La P (a) = --PLap(a)- 

a 



(3-2) 



The probability gj c [(a) deforms the shape of PLap(a) and shifts the location of 
its peakPJ It was shown in Ref. I12[) that when Ph&p(ot) is concentrated around 
its maximum at a = &, the peaks of PLap(a) and Pj e f(a) are located at nearly 
equal values of a, and 2(9) is insensitive to the choice of g(a). We quantitatively 
determine the amount by which gj e f (a) shifts the location of the peak of -PLap(a) in 
the following. 

The derivative of P(a) with respect to a, 



dP(a) 
da 



J_ \ - Afc(a) 
2a ^ a + \ k (a) 



d\ng(a 

+ b(a) H : 

da 



l d X 2 (a) | a dS(a) 
2 da da 



d\ k (a) 



2 ' a + Xk(a) da 



P(a), (3-3) 
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is vanishing at a = a. It is noted that x 2 (o0> S(a) and {Afc(a)} depend implicitly 
on a through Z( a \8), calculated using Eq. 1)2-16(1 . We obtain 

aLap5'(oLap) = ~~^^ g + [derivative terms], (Laplace's rule) (3-4) 

& j e f S(a j e f) = —-z(N g — 2) + [derivative terms], (Jeffrey's rule) (3-5) 

where "derivative terms" represents the derivatives of x 2 (o) 5 S(a) and Afc(a), and 
«Lap and dj e f denote a for Laplace's and Jeffrey's rules, respectively. Here, N g = 
^ fc Afc(a)/(d + Afc(d)), and N g represents the number of effectively independent 
measurements, because the value of the Afc(d) contributes approximately 1 to the 
summation when \k(a) 3> a. Note that since the quantities A^(a) are independent 
of the choice of g(a), N g is also insensitive to g(a). For simplicity, let us ignore these 
"derivative terms" for the time being. With the relative difference between the two 
values of a, 

"Lap - djef / q /i \ 

r& = -z , (3-6) 

"Lap 

we obtain, from Eqs. (|3-4() and (|3-5|) . 

r& = (3-7) 

because the derivative of S(a) is vanishing and ^(aLap) — S(&j e f). Hence, the 
relative difference between the values of a, r^, to good approximation depends only 
on N g . When N g is larger than 2, r& is negligible, and the peaks of PLap( a ) an d 
Pj e f(a) could be located at almost equal values of a. In case that the "derivative 
terms" cannot be ignored, however, using only N g , one cannot properly estimate to 
what extent the location of the peak moves. In such a case, we need to resort to 
numerical calculations. 

Let us investigate the behavior of P{a) in our data; specifically, we check whether 
the derivatives of x 2 ( a )) S(a) and {Afc(a)} are negligible and determine the distance 
between the peaks of PL a p{a) and Pj e f(a). The values of d, aS(a), N g /2 and 
\&S(a) + N g /2\ = D for various mc(9) are listed in Table HVI in the case of Laplace's 
rule. As shown in Eq. (|3-4|) . the value of D provides a criterion for determining 
whether or not the "derivative terms" can be ignored. 

Firstly, we focus on the 7 = 5.0 case. In Eq. (|3-4|) . D makes a smaller contribu- 
tion to dLapS'(d'Lap) than does N g /2 (D ~ 0.04 x N g /2), and hence the "derivative 
terms" can be ignored. It is thus expected that Eq. (|3-7|) holds. Substituting o?Lap 
and N g in Table IP7I into Eq. (l3~71) . we obtain dj f ~ 287. With Jeffrey's rule, on 
the other hand, Pj e f(a) gives dj e f = 293. We thus find good agreement between 
the values of dj e f obtained in these two ways. Next, we determine the distance be- 
tween the peaks of -PLap(a) and Pj e f(a). Because the value of N g is comparable to 
2 (r& ~ 0.27), it is expected that the peaks of Phnp(a) and Pj e f(a) could appear at 
somewhat separated values of a. It is shown that this is indeed the case in the left 
panel of Fig. H3 
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Now let us consider the other cases of the default model mc(fl) in Table llVl 
The values of D are larger than that for 7 = 5.0: we have D ~ 0.67 x N g /2, 
D ~ 0.18 x Ng/2 and D ~ 0.30 x N g /2 for 7 = 8.0, 10.0 and 13.0, respectively. In 
these cases, the "derivative terms" are not negligible, and Eq. (|3-7|) no longer holds. 
The probabilities P(a) for Laplace's and Jeffrey's rules give r& ~ 0.20, 0.14 and 0.13 
for 7 = 8.0, 10.0 and 13.0, respectively. The right panel of Fig. El displays both 
-PLap(a) and Pj e f(a) for 7 = 13.0. In this figure, the peaks of Phap{ot) and Pj e f(a) 
are seen to be located at different values of a. Similar behavior of P(a) is obtained 
for the other default models m(9). 



Table IV. Values of 

S^aLap), and D(= l&LapS^aLap) + N g /2\) for various uig{Q) 

with Laplace's rule. Here, Nq — 7. 



7(Gaussian default) 


«Lap 


«Lap5'(ciLap) 


-NJ2 


laLapSKap) + iV s /2| = D 


5.0 


401 


-3.358 


-3.498 


0.140(~ 0.04 x N g /2) 


8.0 


10200 


-0.992 


-3.046 


2.054(~ 0.67 x Ng/2) 


10.0 


1400 


-2.653 


-3.232 


0.579(~ 0.18 x Ng/2) 


13.0 


460 


-4.419 


-3.410 


1.009(~ 0.30 x Ng/2) 
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Fig. 6. PLap(a) and Pj e f(a). As the default models, Gaussian functions with 7 = 5.0 (left panel) 
and 13.0 (right panel) were used. 

Let us now study Z{9). The a-dependent image Z^ a \6) affects the behavior of 
2(6) through the integral in Eq. I|2-17|) : 



Z(9) 



J da P^pi^Z^iO), (Laplace's rule) 
/ da P Je{ (a)Z^(9). (Jeffrey's rule) 



(3-8) 



If Z^ a \d) does not vary over the range of integration in Eq. (|3-8|) . Z^ a \9) can be 
factored out from the integral and we have Z^ a _ p (8) = Zj e f(0), due to the normal- 
ization of P(a). This is indeed the case for 7 = 5.0: for 6 = 2.60, for example, 
the values of Z^(0) are 2.831 x 10~ 4 for a = 50 and 2.797 x 10~ 4 for a = 1050. 
The latter value of a is the upper limit of a, a max , and the former one is the lower 
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limit of a, a m i n , in the integral. For the other default models listed in Table HVl 
by contrast, all the 2^(9) vary over several orders for 9 > 2.6. For 7 = 13.0, for 
example, the values of 2^(2.60) are 2.80 x 10~ 4 and 9.55 x 10~ 7 for a = a min = 100 
and a = a max = 910, respectively. From these results, it is conjectured that 2(6) is 
almost independent of g(a) for 7 = 5.0 and depends strongly on g(a) for the other 
values of 7. In fact, no g(a) dependence is seen over the entire the range of values of 
for 7 = 5.0, as shown in the left panel of Fig. Q while in the right panel (depicting 
the situation for 7 = 13.0), differences between these two are observed for 9 > 2.6. 
Note that for 9 < 2.0, 2^ ap (9) and Zj c f(9) fall on the same curve in both the 7 = 5.0 
and 13.0 cases. 
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Fig. 7. ^Lap(^) and Zj e f(9) for 9 £ [2.0, n]. The default models used here are the same as in Fig. 
|S| The errors in 2(8) were calculated using Eq. 12T8I . 



3.2 



In order to estimate the influence of g(a) on 2(9), we calculate A(9) given by 
Eq. (|2-21|) . For a systematic estimation, the parameter 7 in ttiq(9) is varied from 
3.0 to 13.0. Figure |H] displays the values of A(9) at 9 = 2.60. The horizontal axis 
represents the value of 7. It is found that the value of A(9) is smallest for 7 = 5.0 
and becomes larger as the value of 7 deviates from 5.0. Table |V| lists the values of 
A(9) for eight images among these with 9 = 2.31, 2.60, 2.83 and 3.14. In the case 
of 7 = 5.0, the value of A(9) increases with 9 for 9 > 2.6. This behavior is also 
observed for the other m(9). It is found that the values of A(9) for 7 = 3.0, 4.0, 5.0, 
m 24/5o(^)i m 32/5o($) an d ^38/5o(^) are quite small over the entire range of values of 
9 and that the images 2^(9) for these six depend only very weakly on a over the 
range of integration in Eq. (|2-17|) . 

We now give a brief comment on e in Eq. (|3-1|) , The difference between 2-^^(9) 
and -Ej e f (9) becomes significant at the value 9 = 9 e , where 2(9 e ) ~ e is satisfied, and 
A(9 e ) ~ 0.1 holds; e.g., for 7 = 13.0, we have 9 e ~ 2.3 (see Fig. [7J). This is true for 
all cases in which 7 > 7.0. 

(iii) The relative error of 2(9) 

Now that the g(a) dependence of 2(9) has been systematically investigated, we 
next consider the uncertainty in 2(9). 
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Fig. 8. Values of A(9) for 9 — 2.60. The Gaussian functions are used as the default models. The 
horizontal axis represents the value of 7 in ma{&). 



Table V. Values of A{9) at 9 = 2.31, 2.60, 2.83 and 3.14 for various m(0). 



default model 


4(2.31) 




4(2.60) 




4(2.83) 




4(3.14) 




ma(0) with 7 = 


= 3.0 


6.30x10" 


■A 


1.30x10" 


2 


1.87x10" 


2 


2.24x10" 


2 


?71g(0) with 7 = 


= 5.0 


5.35x10" 


3 


1.41x10" 


3 


1.12x10" 


2 


1.84x10" 


2 


mo(9) with 7 = 


= 8.0 


1.03x10" 


1 


1.50x10" 


1 


1.70x10" 


1 


1.75x10" 


1 


mo{0) with 7 = 


10.0 


1.01x10" 


1 


4.67x10" 


1 


8.21x10" 


1 


9.60x10" 


1 


ma(0) with 7 = 


13.0 


1.05x10" 


1 


4.96x10" 


-1 


7.12x10" 


1 


7.95x10" 


1 


"124/50 (#) 




3.44x10" 


3 


7.67x10" 


-3 


1.13x10" 


2 


1.36x10" 


2 


^32/50(0) 




7.30x10" 


3 


1.57x10" 


2 


2.29x10" 


2 


2.72x10" 


2 


»n 3 8/ 6 o(0) 




1.30x10" 


2 


2.60x10" 


-2 


3.65x10" 


2 


4.26x10" 


2 



Table VI. Values of \8Z\/Z(9) = \5Z(9)\/Z(9) for 9 = 2.31,2.60,2.83 and 3.14. Here, the same 
default models as in Table El were used. 



default model 


\SZ\/Z(2.31) 


\5Z\/Z(2.60) 


\SZ\/Z(2.83) 


\SZ\/Z(3.U) 


tug (9) with 7 = 


3.0 


1.57xl0 _i 


2.31xl0 _1 


2.91xl0 _1 


6.24x10" 


1 


ma(6) with 7 — 


5.0 


8.82xl0~ 2 


1.17X10"" 1 


1.44X10" 1 


4.58x10" 


1 


mo(9) with 7 — 


8.0 


2.21xl0~ 2 


2.15X10" 1 


1.93 


56.84 




mo(9) with 7 — 


10.0 


3.48 xlO -2 


3.88X10" 1 


1.86 


40.69 




mo(9) with 7 = 


13.0 


6.88xl0~ 2 


2.35xl0 _1 


4.93xl0 _1 


11.94 




"124/50 (0) 




2.70xl0 _1 


4.82X10" 1 


6.54X10" 1 


8.31x10" 


1 


m 32 /Bo(9) 




1.67xl0 _1 


2.73X10"" 1 


3.76xl0 _1 


4.82x10" 


1 


"138/50 (8) 




1.03xl0 _1 


1.57xl0 _1 


2.12xl0 _1 


2.68x10" 


1 



The relative errors in 2(9), \62(9)\/2(9), are displayed in Fig. |§J where 52(6) 
is calculated using Eq. ()2-18[) . For comparison, \52f out (9)\/2f out (9) is also plotted. 
It is observed that all the relative errors increase with 6. In particular, those for 
7 = 8.0, 10.0 and 13.0 diverge for large 9 (9 > 2.6). The value of \52(9)\/2(9) for 
m 24/5o($) is comparable with that of \52-F onr (9)\/ 2p ovir (9), and those for the others 
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Fig. 9. Values of \8Z{6)\/ Z{9) for the selected default models. The results obtained with the 
Fourier method are also plotted. 



are smaller than that of \5Zf out (9)\/ 'ZF our (9). To see in detail how \5Z(9)\/Z(9) 
varies in the large 9 region, we list \5Z(6)\/Z{6) with 9 = 2.31, 2.60, 2.83 and 3.14 
in Table IVTI for various m{9). The relative errors \5Z{9)\/ Z{9) for 7 = 8.0, 10.0 and 
13.0 increase rapidly with 9 and exceed 1.0 for 9 > 2.8. It is seen in Table IVTI that 
for 7 = 5.0, wi32/5o($) an d wi38/5o($)> the images Z{9) have small uncertainties. 

As seen in (ii), some of the images Z^ a \9) vary over several orders as a varies, 
and the images Z{9) calculated from these Z^ a \9) depend strongly on g(a) for large 
values of 9. For these images, the relative errors are large. For 7 > 7.0, Z{9) is 
such that A(9) > 0.1 for 9 > 9 e and \5Z(9)\/Z(9) > 1.0. Contrastingly, the other 
images Z^ a \9) [7 < 6.0 and m^/^^O)] do not vary over the range of integration in 
Eq. (|2-17|) . and these Z{9) are independent of g(a), with A(9) < 0.1 over the entire 
range of 9. The relative errors \5Z{9)\/ Z{9) for these cases [7 < 6.0 and wiz^oC^)] 
increase with 9 but do not exceed 1.0. This indicates that the uncertainty reflects 
the prior information dependence of the most probable image. 

Summarizing the above analysis, we present the results for Z(9) in Fig. ^3 
for various m(9). All the results obtained using the MEM behave smoothly over 
the entire range of 9. For 9 < 2.3, these eight images fall on the same curve, and 
the MEM reproduces images consistent with the FTM. For 9 > 2.5, by contrast, 
the m(9) dependence of Z{9) is clearly seen. In the large 9, Z{9) for 7 = 8.0 and 
13.0, which possess large errors, decrease over several orders as 9 increases, while 
the other images with small errors gradually decrease as 9 increases. Each of these 
images obtained in Fig. ^3 could be a candidate for the true image. With the 
observation that Z{9) depends strongly on m{9) in the region where the values of 
Z{9) are smaller than that of e, the m(9) dependence of Z{9) reflects the flattening 
phenomenon in the FTM. 
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Fig. 10. The most probable images Z(9) for various m(8). The arrow indicates the value of e (= 
3.610 x 10 -4 ). 



§4. Conclusions and discussion 



In this paper, we have applied the MEM to the MC data of the CP 3 model. We 
have studied how the flattening phenomenon is observed with the MEM. For this 
purpose, two types of data were used, that for L = 38, in which no flattening is 
observed, and that for L = 50, in which flattening is reproduced through the Fourier 
transform. 

The results we obtained in the present study are the following. 

1. In the case without flattening, the MEM yielded images 2(9) that are almost 
independent of m(9) and g(a). The most probable images 2(9) are in agreement 
with the result of the FTM within the errors (see Fig. |2] and Table [H} . 

2. In the case with flattening, we have systematically checked (i) the statistical 
fluctuations of 2(9), (ii) the g(a) dependence of 2(9) and (iii) the relative error 
of 2(9). We found that the statistical fluctuations of 2(9) become smaller as 
the number of measurements increases except near 9 = it. We also found that 
2(9) with large errors depends strongly on g(a) in the region of large 9, where 
the g(a) dependence of 2(9) was estimated using the quantity A(9). Our results 
are summarized in Fig. I1U1 All the results obtained using the MEM behave 
smoothly over the entire range of 9. In the region where the value of 2(9) is 
larger than or nearly equal to that of e, (specifically 9 < 2.3), final images fall 
on the same curve, and the MEM reproduces images consistent with the FTM. 
Contrastingly, in the region where the value of 2(9) is smaller than that of 
e, 2(9) depends strongly on m(9). This m(9) dependence of 2(9) reflects the 
flattening phenomenon. Each of these images obtained in Fig. EH could be a 
candidate for the true image. 



The Sign Problem and MEM in Lattice Field Theory with the 9 Term 



19 



In the present study, e given by Eq. <|3-lj) turns out to be an approximate 
indicator of the influence of the error in P(Q) to 2(9) in both the FTM and MEM 
cases. As seen in (ii) presented in ^3.2.21 2(9) starts to exhibit the g(a) dependence 
at 9 = 9 e for m G (0) with 7 > 7.0, where 9 e is defined by 2(0 e ) ~ e, and A(9 e ) ~ 0.1 
holds. The other default models investigated here satisfy A(9) < 0.1 for all 9. For 
these, the g(a) dependence is very weak, even if 2(9) < e. It is worthwhile to study 
the reason for this. 

The magnitude of the relative error depends on m(9) for large 9 (see Fig. |§J). 
At 9 = 9 e , the FTM gives \62 Fom (9 e )\/2 Fom (9 € ) ~ 0.3, while \62(9 e )\/2(9 € ) ~ 0.1 
holds in some cases in the MEM; 2(9) for 7 = 3.0, 4.0, 5.0, 6.0 and m 38 / 50 (9). 
Although 52(9) is the uncertainty in the image, it is necessary to elucidate the 
different manners in which the error in P(Q) affects 2(9) and 2p OUT (9). When 2(9) 
depends strongly on m(9), each 2(9) could be a candidate for the true image. If we 
had proper knowledge about m(9) as prior information, we could identify the true 
image in a probabilistic sense. Such analysis may allow us to clarify the relationship 
between the default model dependence and the systematic error. 

The MEM provides a probabilistic point of view in the study of theories with 
the sign problem. The canonical approach^) in the study of lattice field theory with 
a finite density exhibits a formal correspondence to lattice field theory with the 9 
term. Noting this correspondence, it may be worthwhile to study lattice QCD with 
a finite density in terms of the MEM from a probabilistic point of view. 
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